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Abstract 


Error-correcting codes are efficient methods for handling noisy communication channels 
in the context of technological networks. However, such elaborate methods differ a lot from 
the unsophisticated way biological entities are supposed to communicate. Yet, it has been 
recently shown by Feinerman, Haeupler, and Korman [PODC 2014] that complex coordi¬ 
nation tasks such as rumor spreading and majority consensus can plausibly be achieved 
in biological systems subject to noisy communication channels, where every message trans¬ 
ferred through a channel remains intact with small probability ^ + e, without using coding 
techniques. This result is a considerable step towards a better understanding of the way 
biological entities may cooperate. It has nevertheless been established only in the case of 
2-valued opinions: rumor spreading aims at broadcasting a single-bit opinion to all nodes, 
and majority consensus aims at leading all nodes to adopt the single-bit opinion that was 
initially present in the system with (relative) majority. In this paper, we extend this previous 
work to /c-valued opinions, for any constant k > 2. 

Our extension requires to address a series of important issues, some conceptual, others 
technical. We had to entirely revisit the notion of noise, for handling channels carrying 
k-valued messages. In fact, we precisely characterize the type of noise patterns for which 
plurality consensus is solvable. Also, a key result employed in the bivalued case by Feinerman 
et al. is an estimate of the probability of observing the most frequent opinion from observing 
the mode of a small sample. We generalize this result to the multivalued case by providing 
a new analytical proof for the bivalued case that is amenable to be extended, by induction, 
and that is of independent interest. 


1 Introduction 

1.1 Context and Objective 

To guarantee reliable communication over a network in the presence of noise is the main goal 
of Network Information Theory |18| . Thanks to the achievements of this theory, the impact of 
noise can often be drastically reduced to almost zero by employing error-correcting codes, which 
are practical methods whenever dealing with artificial entities. However, as observed in [19] . the 
situation is radically different for scenarios in which the computational entities are biological. 
Indeed, from a biological perspective, a computational process can be considered “simple” only if 
it consists of very basic primitive operations, and is extremely lightweight. As a consequence, it 
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is unlikely that biological entities are employing techniques like error-correcting codes to reduce 
the impact of noise in communications between them. Yet, biological signals are subject to noise, 
when generated, transmitted, and received. This rises the intriguing question of how entities 
in biological ensembles can cooperate in presence of noisy communications, but in absence of 
mechanisms such as error-correcting codes. 

An important step toward understanding communications in biological ensembles has been 
achieved recently in |19) . which showed how it is possible to cope with noisy communications in 
absence of coding mechanisms for solving complex tasks such as rumor-spreading and majority 
consensus. Such a result provides highly valuable hints on how complex tasks can be achieved 
in frameworks such as the immune system, bacteria populations, or super-organisms of social 
insects, despite the presence of noisy communications. 

In the case of rumor-spreading, |19| assumes that a source-node initially handles a bit, set 
to some binary value, called the correct opinion. This opinion has to be transmitted to all 
nodes, in a noisy environment, modeled as a complete network with unreliable links. More 
precisely, messages are transmitted in the network according to the classical uniform push model 
[Ill[23l[28] where, at each round, every node can send one binary opinion to a neighbor chosen 
uniformly and independently at random and, before reaching the receiver, that opinion is flipped 
with probability at most ^ — e with e > 0. It is proved that, even in this very noisy setting, the 
rumor-spreading problem can be solved quite efficiently. Specifically, m provides an algorithm 
that solves the noisy rumor-spreading problem in 0(^ logn) communication rounds, with high 
probabilitj0 (w.h.p.) in n-node networks, using 0(loglogn-|-log(l/e)) bits of memory per node. 
Again, this algorithm exchanges solely opinions between nodes. 

In the case of majority consensus, |19) assumes that some nodes are supporting opinion 0, 
some nodes are supporting opinion 1, and some other nodes are supporting no opinion. The 
objective is that all nodes eventually support the initially most frequent opinion (0 or 1). More 
precisely, let A be the set of nodes with opinion, and let b G {0,1} be the majority opinion in 
A. The majority bias of A is defined as — |A 5 |)/|A| where Ai is the set of nodes with 

opinion i € {0,1}. In the very same noisy communication model as above, |19) provides an 
algorithm that solves the noisy majority consensus problem for |A| = logn) with majority- 
bias n(y^logn/|A|). The algorithm runs in 0(^ logn) rounds, w.h.p., in n-node networks, using 
0(log log n-|-log(l/e)) bits of memory per node. As for the case of rumor spreading, the algorithm 
exchanges solely opinions between nodes. In fact, the latter algorithm for majority consensus is 
used as a subroutine for solving the rumor-spreading problem. Note that the majority consensus 
algorithm of |19| requires that the nodes are initially aware of the size of A. 

According to |19j . both algorithms are optimal, since both rumor-spreading and majority con¬ 
sensus require Il(^logn) rounds w.h.p. in n-node networks. 

Our objective is to extend the work of |19) to the natural case of an arbitrary number of opinions, 
to go beyond a proof of concept. The problem that results from this extension is an instance 
of the plurality consensus problem in the presence of noise, i.e., the problem of making the 
system converging to the initially most frequent opinion (i.e., the plurality opinion). Indeed, the 
plurality consensus problem naturally arises in several biological settings, typically for choosing 
between different directions for a flock of birds |10| , different speeds for a school of fish |30) , or 
different nesting sites for ants |21) . The computation of the most frequent value has also been 
observed in biological cells m- 

series of events n > 1, hold w.h.p. if Prffn) > 1 — 0(l/n'^) for some c > 0. 
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1.2 Our Contribution 


1.2.1 Our Results 

We generalize the results in |19| to the setting in which an arbitrary large number k of opinions 
is present in the system. In the context of rumor spreading, the correct opinion is a value 
i G /c}, for any constant k > 2. Initially, one node supports this opinion i, and the 

other nodes have no opinions. The nodes must exchange opinions so that, eventually, all nodes 
support the correct opinion i. In the context of (relative) majority consensus, also known as 
plurality consensus, each node u initially supports one opinion G {1,..., A:}, or has no opinion. 
The objective is that all nodes eventually adopt the plurality opinion (i.e., the opinion initially 
held by more nodes than any other, but not necessarily by an overall majority of nodes). As in 
|19j . we restrict ourselves to “natural” algorithms |12) . that is, algorithms in which nodes only 
exchange opinions in a straightforward manner (i.e., they do not use the opinions to encode, e.g., 
part of their internal state). For both problems, the difficulty comes from the fact that every 
opinion can be modified during its traversal of any link, and switched at random to any other 
opinion. In short, we prove that there are algorithms solving the noisy rumor spreading problem 
and the noisy plurality consensus problem for multiple opinions, with the same performances 
and probabilistic guarantees as the algorithms for binary opinions in |19| . 

1.2.2 The technical challenges 

Generalizing noisy rumor spreading and noisy majority consensus to more than just two opinions 
requires to address a series of issues, some conceptual, others technical. 

Conceptually, one needs first to redefine the notion of noise. In the case of binary opinions, the 
noise can just flip an opinion to its complement. In the case of multiple opinions, an opinion i 
subject to a modification is switched to another opinion i', but there are many ways of picking 
i'. For instance, i' can be picked uniformly at random (u.a.r.) among all opinions. Or, i' could 
be picked as one of the “close opinions”, say, either f +1 or f — 1 modulo k. Or, i' could be “reset” 
to, say, i = 1. In fact, there are very many alternatives, and not all enable rumor spreading 
and plurality consensus to be solved. One of our contributions is to characterize noise matrices 
P = (pij), where pij is the probability that opinion i is switched to opinion j, for which these 
two problems are efficiently solvable. Similar issues arise for, e.g., redefining the majority bias 
into a plurality bias. 

The technical difficulties are manifold. A key ingredient of the analysis in m is a fine estimate 
of how nodes can mitigate the impact of noise by observing the opinions of many other nodes, 
and then considering the mode of such sample. Their proof relies on the fact that for the binary 
opinion case, given a sample of size 7 , the number of Is and Os in the sample sum up to 7 . Even 
for the ternary opinion case, the additional degree of freedom in the sample radically changes 
the nature of the problem, and the impact of noise is statistically far more difficult to handle. 
Also, to address the multivalued case, we had to cope with the fact that, in the uniform push 
model, the messages received by nodes at every round are correlated. To see why, consider an 
instance of the system in which a certain opinion b is held by one node only, and there is no 
noise at all. In one round, only one other node can receive b. It follows that if a certain node u 
has received b, no other nodes have received it. Thus, the messages each node receives are not 
independent. In |19| . probability concentration results are claimed for random variables (r.v.) 
that depend on such messages, using Chernoff bounds. However, Chernoff bounds have been 
proved to hold only for random variables whose stochastic dependence belongs to a very limited 
class (see for example m)- In |20j . it is stated that the binary random variables on which the 
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Chernoff bound is applied satisfy the property of being negatively 1-correlated (see Section 1.7 
in |20| for a formal definition). In our analysis, we show instead how to obtain concentration 
of probability in this dependent setting by leveraging Poisson approximation techniques. Our 
approach has the following advantage: instead of showing that the Chernoff bound can be 
directly applied to the specific involved random variables, we show that the execution of the 
given protocol, on the uniform push model, can be tightly approximated with the execution 
of the same protocol on a another suitable communication model, that is not affected by the 
stochastic correlation that affects the uniform push model. 

1.3 Other Related work 

By extending the work of CSl, we contribute to the theoretical understanding of how communica¬ 
tions and interactions function in biological distributed systems, from an algorithmic perspective 
la 0 El El El [25]. We refer the reader to |19) for a discussion on the computational aspects of 
biological distributed systems, an overview of the rumor spreading problem in distributed com¬ 
puting, as well as its biological significance in the presence of noise. In this section, we mainly 
discuss the previous technical contributions from the literature related to the novelty of our 
work, that is the extension to the case of several different opinions. 

In the context of population protocol, the problem of achieving majority consensus in the binary 
case has been solved by employing a simple protocol called undecided state dynamic [5]. In the 
uniform push model, the binary majority consensus problem can be solved very efficiently as a 
consequence of a more general result about computing the median of the initial opinions m- 
Still in the uniform push model, the undecided state dynamic has been analyzed in the case 
of an arbitrarily large number of opinions, which may even be a function of the number of 
agents in the system [8|. A similar result has been obtained for another elementary protocol, 
so-called 3-majority dynamics, in which, at each round, each node samples the opinion of three 
random nodes, and adopts the most frequent opinion among these three |9|. The 3-majority 
dynamics has also been shown to be fault-tolerant against an adversary that can change up to 
0{y/n) agents at each round [151 (9]. Other works have analyzed the undecided state dynamics 
in asynchronous models with a constant number of opinions HSlEaEZ], and the /i-majority 
dynamics (or slight variations of it) on different graph classes in the uniform push model |I3[ [T]. 
Note that the above protocols converge to the correct value with high probability. In [I], the 
tradeoff between deterministic success and convergence speed for protocols solving the binary 
majority consensus problem in population protocols is investigated. 

A general result by Kempe et al. |24j shows how to compute a large class of functions in the 
uniform push model. However, their protocol requires the nodes to send slightly more com¬ 
plex messages than their sole current opinion, and its effectiveness heavily relies on a potential 
function argument that does not hold in the presence of noise. 

To the best of our knowledge, we are the first considering the plurality consensus problem in 
the presence of noise. 


2 Model and formal statement of our results 

In this section we formally define the communication model, the main definitions, the investi¬ 
gated problems and our contribution to them. 

We do not provide a definition of what is a biologically feasible protocol, since the computational 
investigation with this respect is still too premature for such an attempt. As discussed in Section 
II.11 intuitively we look for protocols that are simple enough to be plausible communication 
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strategies for primitive biological system. As the reader can see in section 12.31 and 13.11 we 
consider a natural generalization of the protocol given in | 20 | . which is plainly an elementary 
combination of sampling and majority operations. 


2.1 Communication model and definition of the problems 


The communication model we consider is essentially the uniform push model [HISSIES], where 
in each (synchronous) round each agent can send (push) a message to another agent chosen 
uniformly at random. This occurs without having the sender or the receiver learning about each 
other’s identity. Note that it may happen that several agents push a message to the same node 
u at the same round. In the latter case we assume that the nodes receive them in a random 
order; for a detailed discussion regarding this assumption, we refer the reader to Appendix lAl 
We study the problems of rumor-spreading and plurality consensus. In both cases, we assume 
that nodes can support opinions represented by an integer in [k] = {!,..., A;}. Additionally, 
there may be undecided nodes that do not support any opinion, which represents nodes that are 
not actively aware that the system has started to solve the problem; thus, undecided nodes are 
not allowed to send any message before receiving any of them. 


• In rumor spreading, initially, one node, called the source, has an opinion m € {I,..., A:}, 
called the correct opinion. All the other nodes have no opinion. The objective is to design 
a protocol insuring that, after a certain number of communication rounds, every node has 
the correct opinion m. 

• In plurality consensus, initially, for every i G {I,... , A:}, a set A, of nodes have opinion i. 
The sets A,, i = I, ..., A:, are pairwise disjoint, and their union does not need to cover all 
nodes, i.e., there may be some undecided nodes with no opinion initially. The objective is 
to design a protocol insuring that, after a certain number of communication rounds, every 
node has the plurality opinion, that is, the opinion m with relative majority in the initial 
setting (i.e., \Am\ > \Aj\ for any j ^ m). 

Observe that the rumor-spreading problem is a special case of the plurality consensus problem 
with I Am I = I and \Aj\ =0 for any j 7 ^ m. 

Following the guidelines of |I9) . we work under two constraints: 


1. We restrict ourselves to protocols in which each node can only transmit opinions, i.e., every 
message is an integer in {I,... , A:}. 

2. Transmissions are subject to noise, that is, for every round, and for every node u, if an 

opinion A G {I,... , A;} is transmitted to node u during that round, then node u will receive 
message j G {I,..., A:} with probability pij > 0, where — f- 


The noisy push model is the uniform push model together with the previous two constraints. 
The probabilities {Pi,j}i,je[k] can be seen as a transition matrix, called the noise matrix, and 
denoted by P = {Pi,j)i^j^[k]- The noise matrix in |I9) is simply 


P = 




( 1 ) 


2.2 Plurality bias, and majority preservation 

When time proceeds, our protocols will result in the proportion of nodes with a given opinion 
to evolve. Note that there might be nodes who do not support any opinion at time t. As 
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mentioned in the previous section, we call such nodes undecided. We denote by the fraction 

of nodes supporting any opinion at time t and we call the nodes contributing to opinionated. 

Consequently, the fraction of undecided nodes at time t is 1 — Let be the fraction of 
opinionated nodes in the system that support opinion i S [k] at the beginning of round t, so that 
Sie[fc] ~ fraction of opinionated nodes which receive at least one message 

at time t — 1 and support opinion i G [k] at the beginning of round t. We write = {cf\ ..., 
to denote the opinion distribution of the opinions at time t. Similarly, let 
In particular, if every node would simply switch to the last opinion it received, then 

I Pr[received i \ original message is j]-Pr[original message is j] = c^^-pj^i. 

je[k] je[k] 


That is, 

E[c(*+^) I • P, (2) 

where P is the noise matrix. In particular, in the absence of noise, we have P = I (the 
identity matrix), and if every node would simply copy the opinion that it just received, we had 
jg[^(t+i) I (.(i)j = c(i). So, given the opinion distribution at round t, from the definition of the 
model it follows that the messages each node receives at round t+1 can equivalently be seen as 
being sent from a system without noise, but whose opinion distribution at round t is • P. 
Recall that m denotes the initially correct opinion, that is, the source’s opinion in the rumor¬ 
spreading problem, and the initial plurality opinion in the plurality consensus problem. The 
following definition naturally extends the concept of majority bias in |19| to plurality bias. 

Definition 1. Let d > 0. An opinion distribution c is said to be (5-biased toward opinion m if 
Cm — Ci > 5 for all i ^ m. 


In [TP] , each binary opinion that is transmitted between two nodes is flipped with probability at 
most \ — witlH e = for an arbitrarily small r/ > 0. Thus, the noise is parametrized by e. 

The smaller e, the more noisy are the communications. We generalize the role of this parameter 
with the following definition. 

Definition 2. Let e = e(n) and 5 = 5{n) be positive. A noise matrix P is said to be (e, 5)- 
majority-preserving ((e, (5)-m.p.) with respect to opinion m if for every opinion distribution c 
that is 5-biased toward opinion m, we have (c • P)^ — {c ■ P) i > e5 for all i m. 

In the rumor-spreading problem, as well as in the plurality consensus problem, when we say 
that a noise matrix is (e,(5)-m.p., we implicitly mean that it is (e, 5)-m.p. with respect to the 
initially correct opinion. Because of the space constraints, we defer the discussion on the class 
of (e, 5)-m.p. noise matrices in Section 0] (including its tightness w.r.t. theorems [T] and [ 2 ]) . 


2.3 Our formal results 

We show that a natural generalization of the protocol in |19) solves the rumor spreading problem 
and the plurality consensus problem for an arbitrary number of opinions k. More precisely, using 
the protocol which we describe in Section 13.11 we can establish the following two results, whose 
proof can be found in Section [3j 

^For a discussion on what happens for other values of e, see Appendix [D] 


6 



Theorem 1. Assume that the noise matrix P is (e, 5)-m.p. with e = 0 (n“ 4 +^) for an arbitrarily 
small constant r] > 0 and 5 = 0(y^log n/n). The noisy rumor-spreading problem with k opinions 
can be solved in communication rounds, w.h.p., by a protocol using 0(loglogn + log i) 

bits of memory at each node. 

Theorem 2. Let S with IS"! = n(^logn) be an initial set of nodes with opinions in [k], the 
rest of the nodes having no opinions. Assume that the noise matrix P is {e,6)-m.p. for some 
e > 0, and that S is Ll{^yiogn/\S\)-majority-biased. The noisy plurality consensus problem 
with k opinions can he solved in communication rounds, w.h.p., by a protocol using 

0 (loglogn + log bits of memory at each node. 

For k = 2, we get the theorems in |19j from the above two theorems. Indeed, the simple 2- 
dimensional noise matrix of Eq. ([T|) is e-majority-biased. Note that, as in |19| . the plurality 
consensus algorithm requires the nodes to known the size jSI of the set S of opinionated nodes. 


3 The Analysis 

In this section we prove Theorem [1] and Theorem [2] by generalizing the analysis of Stage 1 given 
in |19| and by providing a new analysis of Stage 2 . Note that the proof techniques required 
for the generalization to arbitrary k signihcantly depart from those in m for the case k = 2. 
In particular, our approach provides a general framework to rigorously deal with many kind of 
stochastic dependences among messages in the uniform push model. 

3.1 Definition of the Protocol 

We describe a rumor spreading protocol performing in two stages. Each stage is decomposed 
into a number of phases, each one decomposed into a number of rounds. During each phase of 
the two stages, the nodes apply the simple rules given below. 

3.1.1 The rule during each phase of Stage 1. 

Nodes that already support some opinion at the beginning of the phase push their opinion at 
each round of the phase. Nodes that do not support any opinion at the beginning of the phase 
but receive at least one opinion during the phase start supporting an omnion at the end of the 
phase, chosen u.a.r. (counting multiplicities) from the received opinionqj. In other words, each 
node tries to acquire an opinion during each phase of Stage 1, and, as it eventually receives some 
opinions, it starts supporting one of them (chosen u.a.r.) from the beginning of the next phase. 
In particular, opinionated nodes never change their opinion during the entire stage. 

More formally, let (p,fd, and s be three constants satisfying (f> > (d > s. The rounds of Stage 
1 are grouped in T -|- 2 phases with T = [log(n/(2s/e^ log n))/log(/3/e^ -|- 1)J. Phase 0 takes 
s/e^logn rounds, phase T -|- 1 takes ())/e^logn rounds, and each phase j with 1 < J < T takes 
/3/e^ rounds. We denote with tj the end of the last round of phase j. 

Let tu be the hrst time in which u receives any opinion since the beginning of the protocol (with 
tu = 0 for the source). Let ju be the phase of tu, and let val(tt) be an opinion chosen u.a.r. by u 

^Note that, in the protocol considered in m, the choice of each node’s new opinion in both stages is based on 
the first messages received. In m, in order to relax the synchronicity assumption that nodes share a common 
clock, they adopt the same sample-based variant of the rule that we adopt here. 
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among those that it receives during phase During the first stage of the protocol each node 
applies the following rule. 

Rule of Stage 1. Each opinionated node u pushes opinion val(rt) during each round of every 
phase j = ju + 1, ...,r + 1. 


3.1.2 The rule during each phase of Stage 2. 

During each phase of Stage 2, every node pushes its opinion at each round of the phase. At the 
end of the phase, each node that received “enough” opinions takes a random sampl^ of them, 
and starts supporting the most frequent opinion in that sample (breaking ties u.a.r.). 

More formally, the rounds of stage 2 are divided in T' + 1 phases with T' = [log(y^r/logn)]. 
Each phase j, 0 < j < T' — 1, has length 2i with i = [c/e^] for some large-enough constant 
c > 0, and phase T' has length 2^' with = 0{e~^ logn). Eor any finite multiset A of elements 
in {!,..., fe}, and any i £ let occ(i, A) be the number of occurrences of i in A, 

and let mode(A) = {i G {1,..., A:} | occ(f. A) > occ(j, A) for every j G {1,..., k}}. We then 
define maj(A) as the most frequent value in A (breaking ties u.a.r.), i.e., maj(A) is the r.v. on 
{l,...,/c} such that Pr(maj(A) = i) = l{igmode(A)}/l™ode(A)|. Let Rj{u) be the multiset of 
messages received by node u during phase j. During the second stage of the protocol each node 
applies the following rule. 

Rule of Stage 2. During each phase j of length 2L of Stage 2 {L = i or i'), each node u 
pushes its current opinion at each round of the phase, and starts drawing a random uniform 
sample S{u) of size L from Rj[u). Provided \Rj{u)\ > L, at the end of the phase u changes its 
opinion to maj(5(ri)). 

Let us remark that the reason we require the use of sampling in the previous rule is that at a 
given round a node may receive much more messages than 2L. Thus, if the nodes were to collect 
all the messages they receive, some of them would need much more memory than our protocol 
does. Finally, observe that overall both stages 1 and 2 take O(^logra) rounds. 

3.2 Pushing Colored Balls into Bins 

Before delving into the analysis of the protocol, we provide a framework to rigorously deal with 
the stochastic dependence that arises between messages in the uniform push model. Let process 
O be the process that results from the execution of the protocol of Section 13.11 in the uniform 
push model. In order to apply concentration of probability results that requires the involved 
random variables to be independent, we view the messages as balls, and the nodes as bins, 
and employ Poisson approximation techniques. More specifically, during each phase j of the 
protocol, let Mj be the set of messages that are sent to random nodes, and Nj be the set of 
messages sent after the noise has acted on them. (In other words, Nj = prove 

that, at the end of phase j, we can equivalently assume that all the messages Mj have been sent 
to the nodes according to the following process. 

Definition 3. The balls-into-bins process B associated to phase j is the two-step process in 
which the nodes represent bins and all messages sent in the phase represent colored balls, with 
each color corresponding to some opinion. Initially, balls are colored according to Mj. At the 
first step, each ball of color i G {1,... , A:} is re-colored with color j G {1,... , A:} with probability 

^Note that, in order to sample u.a.r. one of them, u does not need to collect all the opinions it receives. A 
natural sampling strategy such as reservoir sampling can be used. 






Pij, independently of the other balls. At the second step all balls are thrown to the bins u.a.r. 
as in a balls-into-bins experiment. 

Claim 1. Given the opinion distribution and the number of active nodes at the beginning of 
phase j, the probability distribution of the opinion distribution and the number of active nodes 
at the end of phase j in process O is the same as if the messages were sent according to process 

B. 

It is not hard to see that Claim [1] holds in the case of a single round. For more than one 
round, it is crucial to observe that the way each node u acts in the protocol depends only on the 
received messages Rj{u), regardless of the order in which these messages are received. As an 
example, consider the opinion distribution in which one node has opinion 1, one other node has 
opinion 2, and all other nodes have opinion 3. Suppose that each node pushes its opinion for 
two consecutive rounds. Since, at each round, exactly one opinion 1 and exactly one opinion 2 
are pushed, no node can receive two Is during the first round and then two 2s during the second 
round, i.e. no node can possibly receive the sequence of messages “1,1,2,2” in this exact order. 
Instead, in process B such a sequence is possible. 

Proof of Claim{Jl In both process B and process O, at each round, the noise acts independently 
on each ball/message of a given color/opinion, according to the same probability distribution 
for that color/opinion. Then, in both processes, each ball/message is sent to some bin/node 
chosen u.a.r. and independently of the other balls/messages. Indeed, we can couple process B 
and process O by requiring that: 

1 . each ball/message is changed by the noise to the same color/value, and 

2 . each ball/message ends up in the same bin/node. 

Thus, the joint probability distribution of the sets in process O is the same as the 

one given by process B. 

Observe also that, from the definition of the protocol (see the rule of Stage 1 and Stage 2 in 
Section lrTT) . it follows that each node’s action depends only on the set Rj{u) of received messages 
at the end of each phase j, and does not depend on any further information such as the actual 
order in which the messages are received during the phase. 

Summing up the two previous observations, we get that if, at the end of each phase j, we 
generate the Rj{u)s according to process B, and we let the protocol execute according to them, 
then we indeed get the same stochastic process as process O. □ 

Now, one key ingredient in our proof is to approximate process B using the following process P. 

Definition 4. Given Nj, process P associated to phase j is the one-shot process in which each 
node receives a number of opinions i that is a random variable with distribution Poisson(/ij/n), 
where hi is the number of messages in Nj carrying opinion z, and each Poisson random variable 
is independent of the others. 

Now we provide some results from the theory of Poisson approximation for balls-in-bins experi¬ 
ments that are used in Section o For a nice introduction to the topic, we refer to |26| . 

Lemma 1. Let be independent r.v. such that Xj ^Poisson[\j). The vector (Xi,Xa) 

conditional on X = rh follows a multinomial distribution with m trials and probabilities 

.^1 \ 

Tlj ’ ’ 12 j 
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Lemma 2. Consider a balls-in-bins experiment in which h colored balls are thrown in n bins, 
where hi balls have color i with i S {1,...,/?} and ~ n} ie{i k} 

the number of i-colored balls that end up in bin u, let f{xi^i,...,Xn,i,Xn, 2 ,---',Xn,n,zi,...,Zn) be 
a non-negative function with positive integer arguments xi^i, ...,Xn,i, Xn,2, ■■■TXn,n, zi, Zn, let 
n} ie{i k} independent r.v. such that Y^^i ^Poisson{hi/n) and let Z\,...,Zn be 
integer valued r.v. independent from the Xu,iS and Y^^iS. Then 


® [/ (^1,11 ■■■i^n,l-,Xn, 2 -, ■■■,Xn,n, Zi, ...,Zn)] 


<e^^f\^iE[f iYi^l,...,Yn,uYn,2,.:,Yn,n,Zu:.,Zn)]. 


Proof. To simplify notation^ let Z = (Zi,...,Z„), X = {Xi^i,..., Xn,i, Xn, 2 , ■■■, Xn,n), Y = 

a = (Ai,...,Afc) and 

finally x = (xi, ...,Xk) for any xi,...,Xk. Observe that, while Xu^i and X^^i are clearly depen¬ 
dent, Xu^i and Xyj with i ^ j are stochastically independent (even if u = v). Indeed, the 
distribution of the r.v. for each fixed i is multinomial with A* trials and uniform 

distribution on the bins. Thus, from Lemma [1] we have that distributed as 

conditional on Y2=i^u,i = K, that is 


E 


Therefore, we have 


f{Y,Z) 


'y ^ Yu,i Ai,..., 'y ^ El/j A/j 


11=1 


11=1 


= E[/ {X,Z)]. 


^[f{Y,Z)]= [/ I % = Pi' (% = 

>E[f {Y, Z) I fj; = A] Pr (fj; = A) = E [/ {X, Z)] Pr (% = A) 
= E [/ {V. Z)] n > E [/ (X, Z)] ^ 


hi 


where, in the last inequality, we use that, by Stirling’s approximation, a! < e^/a{-)°‘ for any 
a > 0. □ 


From Lemma [T] and Lemma [2l we get the following general result which says that if a generic 
event £ holds w.h.p in process P, it also holds w.h.p. in process O. 

Lemma 3. Given the opinion distribution and the number of active nodes at the beginning of 
a fixed phase j, let £ be an event that, at the end of that phase, holds with probability at least 
1 — n~^ in process P, for some b > (A;log/i)/(21ogn) with h = Then, at the end of 

phase j, £ holds w.h.p. also in process O. 

Proof. Thanks to Claim [T] it suffices to prove that, at the end of phase j, £ holds w.h.p. in 
process B. 

Let £ be the complementary event of £. Let h = \Mj\ be the number of balls that are 
thrown in process B associated to phase j, where hi balls have color i with i G k} and 

® Note that, if Nj is not yet fixed, the parameters hi of process P associated to phase j are random variables. 
However, if the opinion distribution and the number of active nodes at the beginning of phase j are given, then 
h = ffji = \Xj \ = \Mj \ is fixed. 
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'^^hi = h. Let n} ie{i k} number of i-colored balls that end up in bin u, 

let n} ie{i k} independent r.v. of process P such that ~Poisson(/ij/n) 

and let Zi, ...,Zn be integer valued r.v. independent from the X^^iS and Y^^iS. 

Fix any realization of Nj, i.e. any re-coloring of the balls in the first step of process B. By 
choosing / in Lemma [2] as the binary r.v. indicating whether event £ has occurred, where £ is 
a function of the r.v. Xi^i,Xn,i, Xn, 2 , ■■■, Xn,n, Zi,...,Zn, we get 


Pr [£ ...,Xn,n, -2^1) Zn) \ iVj) < 



Pr {£ (Pi,!, ..., Yn,n, Zu ..., Zn) I N,) . (3) 


Thus, from Eq. ([3]), the Inequality of arithmetic and geometric means and the hypotheses on 
the probability of £, we get 


Pr{£{Xi,U...,Xn,n,Zi,...,Zn)\Nj) <e\ Pr {£ (Tpi , ...,Yn,n, Zi, Z^) \ N,) 


< ' Pr {£ (Tpi,..., Yn,n, Z^) \ Nj) 

Finally, let Af be the set of all possible realizations of Nj. By the law of total probability over 
M, we get that 


^Pr {£{Xi^i,...,Xn,n,Zi,...,Zn)\Nj = s) Pi {Nj = s) 
seJV 

/ \ ~ 

Y.^^i^(^^Y-^Yn,n,Zi,...,Zn)\Nj=s) Pi (Nj = s) 

/ \ ~ 

< eM ^y Pr {£ (Tpi,..., Yn,n, Zi ,..., Z^)) 
k2 

where in the first inequality of the last line we used the hypotheses on the probability of £. □ 


We now analyze the two stages of our protocol, starting with Stage 1. Note that, in the following 
two sections, the statements about the evolution of the process refer to process O. 


3.3 Stage 1 

The rule of Stage 1 is aimed at guaranteeing that, w.h.p., the system reaches a target opin¬ 
ion distribution from which the rumor-spreading problem becomes an instance of the plurality 
consensus problem. More precisely, we have the following. 

Lemma 4. Stage 1 takes O(^logn) rounds, after which w.h.p. all nodes are active and c^’^r+i) 
is 5-biased toward the correct opinion, with 5 = Q,{^,J\ogn/n). 

Proof. The fact that an undecided node becomes opinionated during a phase only depends on 
whether it gets a message during that phase, regardless of the value of such messages. Hence, the 
proof that, w.h.p., = 1 is reduced to the analysis of the rule of Stage 1 as an information 

spreading process. First, by carefully exploiting the Chernoff bound and Lemma [3l we can 
establish Claim [2] and Claim [3] below: 
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Claim 2. W.h.p., at the end of phase 0, we have s/e^logn/3n < < s/e^logn/n. 

Claim 3. W.h.p., at the end of phase j, 1 < j <T, we have 

{p/e^ + 

Proof of Claiml^ and Claiml^ The probability that, in the process O, an undecided node be¬ 
comes opinionated at the end of phase j is 1 — (1 — where h is the number of messages sent 

_ h. ^ 

during that phase. In process P, this probability is 1 — e By using that ei+=' < 1 + x < 
for \x\ < 1 we see that 1 — < 1 — (1 — 1/n)^ < 1 — e~'^. Thus, we can prove Claim[2]and 

Claim [3] for process P by repeating essentially the same calculations as in the proofs of Claim 
2.2 and 2.4 in |20| . Since the Poisson distributions in process P are independent, we can apply 
the Chernoff bound as claimed in |20| . Finally, we can prove that the statements hold also for 
process O thanks to Lemma [2l □ 

From the previous two claims, and by the definition of T we get the following. 

Lemma 5. W.h.p., at the end of phase T, we have = Ct{{l3/e^ - 1 - 

Finally, from Lemma [5l an application of the Chernoff bound gives us the following. 

Lemma 6. W.h.p., at the end of Stage 1, all nodes are opinionated. 

As for the fact that, w.h.p., is a d-biased opinion distribution with 6 = fI(y^logn/n), we 

can prove the following. 

Lemma 7. W.h.p., at the end of each phase j of Stage 1, we have an {e/2y-biased opinion 
distribution. 


Proof. We prove the lemma by induction on the phase number. The case j = 1 is a direct 
application of Lemma fTHl to (* 7 ^ ^)) where the number of opinionated nodes is given 

by Claim [2l and, where the independence of the r.v. follows from the fact that each node that 
becomes opinionated in the first phase has necessarily received the messages from the source- 
node. Now, suppose that the lemma holds up to phase j — 1 < T. Let Sj = {u| ju = j} be the 
set of nodes that become opinionated during phase j. Recall the definition of Mj and Nj from 
Section [3.21 and observe that \Mj \ = lA^ I = (tj — tj-i) n ■ and that the number of times 

opinion i occurs in Mj is \Mj \ c\ ^ . Let us identify each message in Mj with a distinct number 

in 1, ..., \Mj\, and let |M |} binary r.v. such that Xw{i) = 1 if and only if 

w is i after the action of the noise. The frequency of opinion i in Nj is X^{i). 

Thanks to Lemma [HJ it suffices to prove the lemma for process P. By definition, in process P, 
for each i, the number of messages with opinion i that each node receives conditional on Nj 
follows a Poisson(^ {i)) distribution. Each node u that becomes opinionated during 

phase j gets at least one message during the phase. Thus, from Lemma [H the probability that 
u gets opinion i conditional on Nj is 


EL'S! w, (i) 

■spfc Y 

M,i = l /-JW = 1 


1 

Wo 


W = 1 


(i) 


Since opinionated nodes never change opinion during Stage 1, the bias of is at least the 
minimum between the bias of and the bias among the newly opinionated nodes in Sj. 
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Hence, we can apply the Chernoff bound to the nodes in Sj to prove that the bias at the end of 
phase j is, w.h.pjj. 


Pr 




> 



Xw (m) 

W = 1 


\N, 


W=1 


i^) 


1 — 5, 


(4) 


where dj = 0{^\ogn/\Sj\). 
Moreover, note that 


E 



W = 1 


-{■Tj-l) „( t 7 '- i ) 




Furthermore, (conditional on and the r.v. |jv^.|} are independent. 

Thus, for each i m, from Claim [3l and by applying the Chernoff bound on X^{m), and 

oil Y}w=iXwii), we get that w.h.p. 


1 


\N, 


Wj\ 

E Xu, (m) 

W=1 


, i«ji 

^ W = 1 


(5) 


where Sj = 0{^ylogn/\Nj\). 

From Claim [2] and Claim [3l it follows that 5j,5j < | w.h.p. Thus by putting together Eq. dl 
and ([5]) via the chain rule, we get that, w.h.p., 


c&> - 4’->' > 




□ 

Lemma [7] implies that, w.h.p., we get a bias = H(y^logn/n) at the end of Stage 1, which 
completes the proof of Lemma [H □ 


3.4 Stage 2 


As proved in the previous section, w.h.p., all nodes are opinionated at the end of Stage 1, and 
the final opinion distribution is H(y^log n/n)-biased. Now, we have that the rumor-spreading 
problem is reduced to an instance of the plurality consensus problem. The purpose of Stage 2 
is to progressively amplify the initial bias until all nodes support the plurality opinion, i.e. the 
opinion originally held by the source node. 

During the first T’ phases, it is not hard to see that, by taking c large enough, a fraction 
arbitrarily close to 1 of the nodes receives at least I messages, w.h.p. Each node u in such 
fraction changes its opinion at the end of the phase. With a slight abuse of notation, let 
maj£(u) = maj (5(tt)) be u’s new opinion based on the i = |5(u)| randomly sampled received 
messages. We show that, w.h.p., these new opinions increase the bias of the opinion distribution 
toward the plurality opinion by a constant factor > 1. 

For the sake of simplicity, we assume that ^ is odd (see Appendix O for details on how to remove 
this assumption). 


®We remark that Eq. Q concerns the value of Pr(c 





\Nj), which is a random variable. 
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Proposition 1. Suppose that, at the beginning of phase j of Stage 2 with 0 < j < T' — 1, the 
opinion distribution is 6-biased toward m. In process P, if a node u changes its opinion at the 
end of the phase, then, for any i ^ m, we have 


Pr = m) — Pr (maj£(u) = i) > 


yj- g(fc—2) ln4 ’ 


( 6 ) 


where 


g{54) = { 


j ( 5(1 — ( 5 ^) 2 




First, we prove Eq. ([ 6 ]) for k = 2. We then obtain the general case by induction. The proof for 
A: = 2 is based on a known relation between the cumulative distribution function of the binomial 
distribution, and the cumulative distribution function of the beta distribution. This relation is 
given by the following lemma. 

Lemma 8. Given p £ (0,1) and 0 < j < i it holds 

E =(i! i) (■>+r ‘ 

j<i<i ^ ^ \J / Jo 

Proof. By integrating by parts, for j < i — 1 we have 

J (j+ 1) 1 ^" (1 - dz =(^.^ (1 - 

^ - j - 1) J Z^^^ (1 - zY~^~^ dz 

{1 - 

yj + 2)l\^+^{l-zY-^-^dz, ( 7 ) 


J + I 2 


£ 

J + l 
e 

J + 2, 


where, in the last equality, we used the identity 




£ 

.i + 1 


(i + i). 


Note that when j = £ — 1, Eq. ([6|) becomes 


p'- = 


rp 

I / z^-^dz. 

Jo 

Hence, we can unroll the recurrence given by Eq. ([7]) to obtain 

G! 1 ) r'Z' 


j<i<^ 


£—i 


concluding the proof. 


□ 
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Lemma [5] allows us to express the survival function of a binomial sample as an integral. Thanks 
to it, we can prove Proposition [1] when k = 2. 


Lemma 9. Let c = (ci,C2) he a 5-biased opinion distribution during Stage 2. In process P, for 
any node u, we have Pr (maj£(u) = m) — Pr {ma,]i{u) = 3 — m) > \J2 £/tt ■ g (5, L). 

Proof. Without loss of generality, let m = 1. Let be a r.v. with distribution Bin{£,pi), 


and let = I — X'f’'. By using Lemma [HI we get 
Pr (maj^(u) = 1) - Pr (maj^(rr) = 2) = Pr > xf''^ - Pr 




E 


.. )piP2 * 


E 


, j/i V2 




E p5(i-p.)'-‘- E pL(i-p.)‘ 


rbUJ (1 

io 


111/ N I I 

2 ;L 2 J (1 - 2 ;)L 2 J dz 

11 \ 

— z)L2J . 


0 


By setting t = z — and rewriting pi = -\- 1 and p 2 = 22_Ei _|_ 1 obtain 

■pi 


Pr (maj£(«) = 1) - Pr = 2) = 


0 


r%LiJ (1 

Jo 


2;L2J (1 _ 2;)L2j dz 

“ I 

- z)^^i dz 


2 

-Pi -P2 


, Pi -P2 


1 




LIJ 


dt 


-r 


dt 


, Pi -P2 
2 


1 2A , 

, - — P 1 dt. 

PI -P2 V 4 


For any t e (—|, |) C (—2^-^, it holds 


LiJ /i-n 2 \LIJ 




Thus, for any y S (—Pi +P2jPi — P 2 ) we have 


“A 2 ^^’L /I-9 


2XUJ 


( 8 ) 


The r.h.s. of Eq. ([8]) is maximized w.r.t. y £ (—Pi +P2)Pi — P 2 ) when 


y = mm < pi - p 2 , 


niJ + i 


= mm<pi -p 2 , 


VI}' 
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Hence, for Pi — P2 < we get 


^/i ,nLIJ 

■^-t] dt>{pi-p 2 ) 


P1-P2 

2 


1 - (pi -P 2 ) 


2\ LIJ 


= 2 {pi - P2) (^1 - {pi - P2) 

= 2-^+^g{pi-p2,t}. 


-1 
2\ 2 


For Pi - P2 > we get 


/.‘LS !Li. / -I \ I 4 I 

I PI-P2 y4 y 


> 


i-e+i 


VI 


£-1 


l-j] = 2-^+V (pi - P 2 V) 


By using the fact that s' is a non-decreasing function w.r.t. its first argument, we obtain 


> 


> 



'f 

LmJ 

2 


'f 

LmJ 

2 


'f 

LmJ 

2 


- PI -P7. 

_P 12 ZP 2 . V 4 
2 


- r 


LIJ 


dt 


2 ^^^9{pi-P2V) 
2-^+^g{6,i). 


Finally, by using the bounds (see Lemma USD, and > 1 — a; together with the 

identitj0 


£ 


1 \i+l 

LK 

2 


(.+1 (9 \ t—1 

o / Vo 


= ()f. 


we get 



'f 

(mJ 

2 


2-^+ip(5,£) 


9 ^~l 2 

> • 2~^^^g{6,t) 


TT 


£-1 


>,/'“(!- 


vr V 9 (^- 1 ) 
Plf 

>\--g{SJ), 

V vr 


1 - ^ 


2 


5 (- 5 ,^) 


concluding the proof. 

Next we show how to lower bound the above difference with a much simpler expression. 


□ 


Lemma 10. In process P, during Stage 2, for any node u, Pr(maj£(u) = m) — Pr(maj^(u) = 
3 — m) is at least Pr(x|^^ > ..., — Pr(X,L^^ > where 

X(^) = (xj^\ ..., X^^^) follows a multinomial distribution with I trials and probability distribution 


c-P. 


"Recall that we are assuming that £ is odd. 
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Proof. Without loss of generality, let m = 1. Let x = (xi, ...,Xk) denote a generic vector with 
positive integer entries such that ~ VL(x) be the set of the greatest entries of x, 

and, for j G {!,*}, let 


• = {x| W(x) = {j}}, 

• = {x| G Id^(x)}, 

• = {x I 1 G W(x) A r 0 W(x) A |Ty(x)| >1} and 

• = {x I i G W(x) A 1 0 W(x) A |W(x)| > 1}. 


It holds 


Pr (maj^(M) = j) 


= E 

Pr = x) 

Pr (mai^{u) 

= j 

xW=x) 

xeAf 





+ E 

Pr = x^ 

Pr ^maj£(rt) 

= j 

xW = x) 

xeAj 

) 




+ E 

Pr = x^ 

Pr (^maj£(u) 

= j 

xW=x) 




= E 

Pr = x) 

+ E 

= x) 
LF(x)| 

xeAf 


xeA(. ) 




+ E 


Pr = x) 
|VK(x)| 


(9) 


Let 

C7(x) = {Xi,...,Xi-l,Xl,Xi+l, ...,Xk) 

be the vector function that swaps the entries xi and Xi in x. cr is clearly a bijection between 
the sets A^^ and A^\ A^~\ A^'^\ respectively, namely 

a : ^ Af\ a : ^4^“^ 

where denotes a bijection. 

Moreover, for all x G A^j~\ it holds 


Pr 


= X = Pr 


= cj (x)) . 


Therefore 


E 


= X = 


E p-' 


xe^i ^ (T{x)eAi i 


Furthermore, for all x G A^^\ we have 

Pr 


= ^ (^)) = 


= X 


( 10 ) 


= X = 


> 


Xl ... Xk 

i 

Xl ... Xk 


X^ Xo Xu 

Pi ■■■Pi -..Pk 


Xo X-\ 

Pi ■■■Pi 


Xu 

■Pk 


= Pr Xr^=a(x) , 


( 11 ) 
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where (t(x) G From Eq. m we thus have that 


Pr = xj > Y Pi' 

a{x)GA^^'^ 

From Eq. ©, COD and (1121) we finally get 


= w) = 


= X 


(#) 


Pr (maj^(u) = 1) — Pr (maj£(tt) = i) = Y^ Pi’ 


xSA 


(!) 


= X 


xsA! 


+ E 


Pr (XW = x) 


ceA' 


(=) 


|lP(x)| 


+ y y p, 

^ |lP(x)| ^ 




= X 


xeA, 


(!) 


Pr (XW = x) ^ Pr (XW = x) 


<eA) 


(=) 


xsA, 


(A) 


|lF(x)| 


> 


y Pr 


= X - 


y Pr 


= X 


xeA^'^ 


xsA! 


(!) 


= Pr 


= {xf}) - Pr 


( 12 ) 


= ] ; , 


concluding the proof of Lemma [101 


□ 


Intuitively, Lemma [TO] says that the set of events in which a tie occurs among the most frequent 
opinions in the node’s sample of observed messages does not favor the probability that the node 
picks the wrong opinion. Thus, by avoiding considering those events, we get a lower bound on 
Pr(maj£(u) = 1) — Pr(maj^(u) = i). 

Thanks to Lemma □roi the proof of Eq. ([^ reduces to proving the following. 

Lemma 11. For any fixed k, and with X defined as in Lemma [7^ we have 


Pr(y) > ...,XW) - Pr(y) > ...,Xy ,y|\, ...,xfP 


> vW 


TT 


gJM 

4^-2 ' 


(13) 


Proof. We prove Eq. m by induction. Lemma [9] provides us with the base case for k = 2. Let 
us assume that, for k < k, Eq. (1131) holds. For A: = k + 1, by using the law of total probability, 
we have 


£ E Pi'(v!'’>Af',...,4«i 

h=0 

L^J 

. V w V w V w 




XY = /i) Pr ( = h 


K-\-l 


.{£) _ 


Y Pr yP>yP,...,yP,Xa,...,X 


W 

/t+1 


h=0 


‘‘K + l 


4';i = '*)pd4?i = '* 


(14) 


Now, argmaxjlxj^^} = X^^'^ and xjfl_.^ < 
r.h.s. of Eq. (I14D . we have 


e 


together imply X^^^^ > ^+1. 


Thus, in the 


r(£) 


r(£) 




K+1 


X, 


(^) 

K+1 


= h] = Pr ( > X 


^(£) 


..,xW 


X 




= h 
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and 


/C+1 


= Pr (xf > Xi'>,..., 4'> 


= '! 


= '• 


Moreover, follows a multinomial distribution with parameters p and £. Thus = h 
implies that the remaining entries follow a multinomial distribution with I — h 

trials, and distribution f=^)' be the distribution of 

xf \ ...,X^'^^ conditional on X^'^ = h. From Eq. (fTHl we get 


Pr 


> A2 , ..., A^_^g - Pr ^^A. > A^ , ..., A._^, A.^^, ..., A, 


{(■) 

K+1 


> X; Pr(xf)>xf,...,xW 

h={) 

Lt^J 


X^!L =/i)Pr(X!/i =h 




.{e) 


Pr(xf ,...,XW 




Xi^}, =h]Prl Xl^’ = h 


^K+1 




^/C+1 


h=0 

L4tJ 

> 

h=0 

— Pr 


^ (pr (y/'-'*) > ..., yi'-^)) - 

(y/'-") > y/'-"\ ...,Yti'^\Y^-^\-.Yt^^)) Pr (xfj, = h) . 
Now, using the inductive hypothesis on the r.h.s. of Eq. m we get 


(15) 


L4tJ 


^ (pr ..., yj'-^)) 

(yi^-h) > yii-h)^ Pr [xi% = h) 


h=0 

— Pr 


L4tJ 


+ 1 - 

£ E 

/i=0 


2i-2hg {6, i-h) 


TT 4 

t 


AC—2 


Pr 1X^1 = ft 


> 




TT 4 


AC—2 


L4iJ 


h=0 


where, in the last inequality, we used the fact that 5 is a non-increasing function w.r.t. the 
second argument (see Lemma ESI). 

It remains to show that 

L4tJ 


h=0 


h 


E \/i-fpdyi='=y 


1 


Let be a r.v. with probability distribution Bin{i, Since X^^^^ ~ Bin{i,pK+i) with 

Pk-i -1 E a- standard coupling argument (see for example |17l Exercise 1.1.]), enables to show 
that 




Pr{xi^l,<h)>PT[W::_l,<h 


rW 
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Hence, we can apply the central limit theorem fLemma 11411 on and get that, for any 

~ ^ 2 —^ ^ there exists some fixed constant £o such that, for ^ > £o) we have 


Pr I Afi < 


K+1 


> Pr ( fPiJi < 


K+1 


> 


1 


— e 


By using Eq. (flGll . for i > Iq we finally get that 


L4tJ 


h=0 




> 


1 

e 

K+l 


i 

/ 

2 

1 - 


K + 1 

/K-1 (1 


L4tJ 

E p--( 


4?i = 


h=0 


<;rM 


K + 1 ■ V2 - Va ■ (2 ^)-r 


1 


concluding the proof that 

Pr (maj^(ri) = 1) — Pr (maj£(rt) = i) > 


yj- g(fc—2) ln4 ' 


(16) 


□ 


By using Proposition [H we can then prove Lemma [12] 

Lemma 12. W.h.p., at the end of Stage 2, all nodes support the initial plurality opinion. 


Proof. Let 6 = Q{\/logn/n) be the bias of the opinion distribution at the beginning of a generic 
phase j < T' of Stage 2. Thanks to Proposition (T] by choosing the constant c of the phase 
length large enough, in process P we get that Pr(maj£(u) = m) — Pr (maj£(tt) = i) > a5 for 
some constant a > 1 (provided that 5 < 1/2). Hence, by applying Lemma [TUI in Appendix [B] 
with 9 = jd, we get Pr(cm ~ ^ — Oid/2) < exp(—(ah)^n/16) < for some constant a 

that is large enough to apply Lemma [2] Therefore, until 5 > 1/2, in process P we have that 
Cm ~ c^ > a5l2 holds w.h.p. From the previous equation it follows that, after T' phases, the 
protocol has reached an opinion distribution with a bias greater than 1/2. Thus, by a direct 
application of Lemma fTHl and Lemma[2jto — c['^'\ we get that, w.h.p., = 1, 

concluding the proof. □ 


Finally, the time efficiency claimed in Theorem[Tjand Theorem [2] directly follows from Lemma fT^ 
while the required memory follows from the fact that in each phase each node needs only to 
count how many times it has received each opinion, i.e. to count up to at most 0{-^ logn) w.h.p. 


4 On the Notion of (e, ^)-Majority-Preserving Matrix 

In this section we discuss the notion of (e, h)-m.p. noise matrix introduced by Definition [2] Let 
us consider Eq. ©■ The matrix P represents the “perturbation” introduced by the noise, and 
so (c ■ P)m — (c ■ P)i measures how much information the system is losing about the correct 
opinion m, in a single communication round. An (e, (5)-m.p. noise matrix is a noise matrix that 
preserves at least an e fraction of bias, provided the initial bias is at least 6. The (e, h)-m.p. 
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property essentially characterizes the amount of noise beyond which some coordination problems 
cannot be solved without further hypotheses on the nodes’ knowledge of the matrix P. To see 
why this is the case, consider an (e, 5)-m.p. noise matrix for which there is a 5-biased opinion 
distribution c such that (c • P)^ — (c • P)i < 0 for some opinion i. Given opinion distribution c, 
from each node’s perspective, opinion m does not appear to be the most frequent opinion. Indeed, 
the messages that are received are more likely to be i than m. Thus, plurality consensus cannot 
be solved from opinion distribution c. 

Observe that verifying whether a given matrix P is (e, (5)-m.p. with respect to opinion m consists 
in checking whether for each i ^ m the value of the following linear program is at least e5: 

maximize {P ■ c)m — {P ■ c)j 

subject to = 1, 

j 

and Vj, Cj > 0,Cm — Cj — 6 > 0. 


We now provide some negative and positive examples of (e, 5)-m.p. noise matrices. First, we 
note that a natural matrix property such as being diagonally dominant does not imply that the 
matrix is (e, (5)-m.p. For example, by multiplying the following diagonally dominant matrix by 
the (5-biased opinion distribution c = (1/2 + 5,1/2 — 5,0)''', we see that it does not even preserve 
the majority opinion at all when e, 5 < 1/6: 



0 


0 




+ e 


On the other hand, the following natural generalization of the noise matrix in m (see Eq. 
is (e, 5)-m.p. for every <5 > 0 with respect to any opinion: 


~ Pi,3 ~ 


+ e 

_ _ e 

k k—1 

More generally, let P be a noise matrix such that 



if i = j, 
otherwise. 


iP) 




P 

qi < Qij < qu 


for some positive numbers p, Qu and qi. Since 


if i = j, 
otherwise. 


{PP)m iP^)i — PCm T ^ ^ qjt^^j P^i ^ ^ qj,i^j 

j^m j^i 

> p{cm - p) + qpj - ^ quCj 

3¥=rn j^i 

> p{cm - Ci) + qi{l - Cm) - g«(l - Cj) 

> p{cm - Ci) +qi- qiCm -qu + quCi 

> p{cm - Ci) - qu{Cm “ p) - {qu - qi) 
>{p- qu){cm - Ci) - {qu - qi) 

> {p-qu)S - {qu - qi)- 


(17) 


(18) 


By defining e = {p—qu)/2, we get that the last line in Eq. (fTHl) is greater than ed iff {p — qu)d/2 > 
{qu — qi), which gives a sufficient condition for any matrix of the form given in Eq. (I17p for being 
(e, (5)-m.p. 
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5 Conclusion 


In this paper, we solved the general version of rumor spreading and plurality consensus in 
biological systems. That is, we have solved these problems for an arbitrarily large number k of 
opinions. We are not aware of realistic biological contexts in which the number of opinions might 
be a function of the number n of individuals. Nevertheless, it could be interesting, at least from 
a conceptual point to view, to address rumor spreading and plurality consensus in a scenario 
in which the number of opinions varies with n. This appears to be a technically challenging 
problem. Indeed, extending the results in the extended abstract of |20| from 2 opinions to any 
constant number k of opinions already required to use complex tools. Yet, several of these tools 
do not apply if k depends on n. This is typically the case of Proposition [T] We let as an open 
problem the design of stochastic tools enabling to handle the scenario where k = k{n). 

Acknowledgments. We thank the anonymous reviewers of an earlier version of this work for their 
constructive criticisms and comments, which were of great help in improving the results and their pre¬ 
sentation. 
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APPENDIX 


A The Reception of Simultaneous Messages 

In the uniform push model, it may happen that several agents push a message to the same node 
u at the same round. In such cases, the model should specify whether the node receives all such 
messages, only one of them or neither of them. Which choice is better depends on the biological 
setting that is being modeled: if the communication between the agents of the system is an 
auditory or tactile signal, it could be more realistic to assume that simultaneous messages to 
the same node would “collide”, and the node would not be able to grasp any of them. If, on the 
other hand, the messages represent visual or chemical signals (see e.g. [301 EB [ini [9]), then it 
may be unrealistic to assume that nodes cannot receive more than one of such messages at the 
same round and besides, by a standard balls-into-bins argument (e.g. by applying Lemma [3]), 
it follows that in the uniform push model at each round no node receives more than Oilogn) 
messages w.h.p. In this work we thus consider the model in which all messages are received, also 
because such assumption allows us to obtain simpler proofs than the other variants. We finally 
note that our protocol does not strictly need such assumption, since it only requires the nodes 
to collect a small random sample of the received messages. However, since we look at the latter 
feature as a consequence of active choices of the nodes rather than some inherent property of 
the environment, we avoid to weaken the model to the point that it matches the requirements 
of the protocol. 


B Technical Tools 


Lemma 13. For any integer r >1 it holds 


2 ^” ^ f2r\ 22” ^ 

-e9r < < —-- eSr . 

/vrr V r / WTrr 


Proof. By using Stirling’s approximation [29 


\/27rr ei2r+i < r! < \/2 
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;7rr ( - 1 ei2^ , 


we have 
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1 1 

The proof of the upper bound is analogous (swap and in the first inequality). 

Lemma 14. Let Xi ,..., be a random sample from a Bernoulli{p) distribution with p £ (0, 1) 
constant, and let Z ~ iV(0,1). It holds 


lim 

n^oo . 


sup 


I Ei=i ^ pn < ^ ) _ Pr ( ^ < 
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Yp0--p) 


= 0 . 


25 























Lemma 15. The function 


g{x,y) = < 


X 


il-x-) 


1 - 


\ y 




with X G [0,1] and y G [l,+oo) is non-decreasing w.r.t. x and non-increasing w.r.t. y. 
Proof. To show that g{x, y) is non-decreasing w.r.t. x, observe that 


A 

dx 

for X < and 


g (x, y) = ^(l - x^) 2 - 2x2 (^T~) ) 


(l — x^) 2 _ 2x^ ~ ^ ^ 

for X < y~2. 

To show that g{x,y) is non-increasing w.r.t. y, observe that this is true for x < y~ 2 . For 
_ 1 

X > y 2 , since 


»-i 


a 


dy 


y -1 


— log y 2 H-— log 1-= TT 
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y 


d fy-1 


dy y 2 ■ (y - 1 ) - I log y ) < 0, 


we have 
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d 


y-1 


(a;, y) = exp Oog y 2 H -— log 1-K - 0’ 


1 


dy dy 

concluding the proof. 

Lemma 16. Let {Xt} be n i.i.d. random variables such that 

1 with probability p, 

= < 0 with probability r, 

^—1 with probability q. 


□ 


with p -\- r -\- q = 1. It holds 
Pr 

Proof. Let us define the r.v. 


E^. 


— 6n\ < exp I — — I E 




-|- n 


Yt = 


Xt + l 


( 19 ) 


We can apply the Chernoff-Hoeffding bound to Yt (see Theorem 1.1 in HZI), obtaining 

02 


Pr < (1 - 0)E 


Er. 


< exp I -yE 
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for any 6 G (0,1). Substituting Eq. (|19l) we have 


Pr ( + n < (1 - 6») (e 




+ n ) ) = Pr ( < (1-0)E 

E^^ 




— On 


< exp (-Me 


+ n 


concluding the proof. 
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C Removing the Parity Assumption on I 

The next lemma shows that, for k = 2, the increment of bias at the end of each phase of Stage 2 
in the process P is non-decreasing in the value of i, regardless of its parity. In particular, since 
Proposition [T] is proven by induction, and since the value of i affects only the base case, the next 
lemma implies also the same kind of monotonicity for general k. 

Lemma 17. Let k = 2, a = 1, let £ be odd, and let (c • P)i > (c • P) 2 - The rule of Stage 2 of 
the protocol is such that 

Pr (maj£(u) = 1) = Pr (maj£+i(u) = l) < Pr (maj£+ 2 (w) = l) , 

Pr (maj^(u) = 2) = Pr (maj^+i(u) = 2) > Pr (maj£+ 2 (^) = 2) . (20) 

Proof. To simplify notation, let pi = {c ■ P)i and p 2 = {c ■ P) 2 - By definition, we have 

'f 
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Pr (maj£(u) = 1) = Pr > 

Pr (maj^_,_;^(tt) = l) = Pr > 

Pr (maj£+ 2 (^) = l) = Pr > 


'+1 

2 

T + 2 


^ Pr 


where X^\ and are binomial r.v. with probability pi and number of trials £, 

I and i p2, respectively. We can view X^\ xf^^\ and as the sum of ^ 1 

and £ + 2 Bernoulli{pi) r.v., respectively. In particular, let Y and Y be independent r.v. with 
distribution Bernoulli{pi). We can couple X^\ and as follows: 




(^+1) 


= Wf ^ + Y 


and 


^((.+ 2 ) _ j^(£+l) _|_ y ' 


Since £ is odd, observe that if X^'^ > |~|], then maj^(ri) = 1 regardless of the value of Y, and 
similarly if xf'^ < |~fl then maj^(u) = 2. Thus we have 
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As for the last two terms in the previous equation, we have that 


Pr maj^+i(u) = 1 


xf ^ = 


= Pr(y = 1)+Pr(y = 0)-, 


and 
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Pr ^maj^+i(u) = 1 
Moreover, by a direct calculation one can verify that 


= Pr(y = l)-. 


Pr ( X[^'^ = 


Pr (y = 0) / 

Pr(y = l) “ 


( 22 ) 


(23) 


(24) 


From Eq. (f22]l . (l23]l and 
Pr ^maj^+i(u) = 1 
+ Pr ^maj^+i(u) = 1 


it follows that 


xf ^ = 
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By plugging Eq. (f2^ in Eq. (1^ we get 

Pr {maj(,{u) = 1) = Pr (maj£+i(u) = l) . 

The proof that 

Pr (maj^(u) = 2) = Pr (maj£+i('u) = l) 
is analogous, proving the first part of Eq. (|20p . 

As for the second part, observe that if xj^"*"^^ > then maj£_|_ 2 (ri) = 1 regardless of the value 
of Y', and similarly if then maj£_|_ 2 (u) = 2. Observe also that 


Pr ^maj^+ 2 («) = 1 
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= Pr (y = 1) = Pi. 
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Because of the previous observations and the hypothesis that pi > we have that 
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(26) 


The proof of 


Pr (maj^+ 2 (''r) = 2) < Pr (maj£+ 2 (^r) = 2) 


is the same up to the inequality in (f26l) . whose direction is reversed because P2 < k- 


□ 


D Rumor spreading with e = 0(n 4 ’i) 

In |20| it is shown that at the end of Stage 1 the bias toward the correct opinion is at least e^~^‘^j2 
and, at the beginning of Stage 2, they assume a bias toward the correct opinion of n(Y^logn/n). 
In this section, we show that, when e = Q{n~i~'^) for some rj G (0,1/4), the protocol considered 
by |20| and us cannot solve the rumor-spreading and the plurality consensus problem in time 
©(logn/e^). 

First, observe that when e = 0(y^log n/n) the length of the first phase of Stage 1 is 0 (logre/e^) = 
n(nlogn), which implies that, w.h.p., each node gets at least one message from the source dur¬ 
ing the first phase. Thus, thanks to our analysis of Stage 2 we have that when e = 0(y^log n/n) 
the protocol effectively solves the rumor-spreading problem, w.h.p., in time 0(logre/e^). 

In general, for e < for some constant ry > 0, if we adopt the second stage right from the 

beginning (which means that the source node sends e~^ messages), we get that, w.h.p., all nodes 
receive at least logn/(e^n) messages. Thus, by a direct application of Lemma fTHl after the hrst 
phase we get an y^logn/n-biased opinion distribution, w.h.p., and Stage 2 correctly solves the 
problem according to Theorem [2J 

However, when e = for some ry > 0, from Claim [2] and Lemma [7] we have that, after 

phase 0 in opinion distribution c, at most O (logn/e^) = 0 (n 2 +^^ log n) nodes are opinionated, 
and c is |-biased. Each node that gets opinionated in phase 1 receives a message pushed from 
some node of c, and, because of the noise, the value of this message is distributed according to 
• P. It follows that c is an e^/2-biased opinion distribution with = n“ 2 “^^ which is much 
smaller than the Q{^/logn/n) bound required for the second stage. 

We believe that no minor modification of the protocol proposed here can correctly solve the 
noisy rumor-spreading problem when e = in time O (logn/e^). 
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